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Cafibration is nowadays one of the most important processes 
involved in the extraction of valuable data from measurements. 
The current availability of an optimum data cube measured 
from a heterogeneous set of instruments and surveys relies 
on a systematic and robust approach in the corresponding 
measurement analysis. In that sense, the inference of configurable 
instrument parameters can considerably increase the quality of 
the data obtained. 

Any measurement devoted to scientific purposes contains an 
element of uncertainty. The level of noise, for example, determines 
the limit of usability of an image. Therefore, a mathematical 
model representing the reality of the measured data should also 
include at least the sources of noise which are the most relevant 
ones for the context of that measurement. 
This paper proposes a solution based on Bayesian inference 
for the estimation of the configurable parameters relevant to 
the signal to noise ratio. The information obtained by the 
resolution of this problem can be handled in a very useful way 
if it is considered as part of an adaptive loop for the overall 
measurement strategy, in such a way that the outcome of this 
parametric inference leads to an increase in the knowledge of a 
model comparison problem in the context of the measurement 
interpretation. 

The context of this problem is the multi-wavelength measure- 
ments coming from diverse cosmological surveys and obtained 
with various telescope instruments. As a first step,, a thorough 
analysis of the typical noise contributions will be performed 
based on the current state-of-the-art of modern telescope instru- 
ments, a second step will then consist of identifying configurable 
parameters relevant to the noise model under consideration, 
for a generic context of measurement chosen. Then as a third 
step a Bayesian inference for these parameters estimation will 
be applied, taking into account a proper identification of the 
nuisance parameters and the adequate selection of a prior 
probability. Finally, a corresponding set of conclusions from the 
results of the implementation of the method proposed here will 
be derived 

I. Introduction 

As indicated in |3|, astronomical photometry is about the 
measurement of the brightness of radiating objects in the 
sky. Many factors, such as those coming from the instrument 
limitation, from a fixed measurement strategy, or the limitation 
from the mean through which the measurement is taking 



place, make this area of the science relatively imprecise. The 
improvement in the dectectors technology plays a key role in 
the area of optimizing the resulting astronomical photometric 
measurements. In this sense, a signal-to-noise ratio capable of 
being configured as part of an optimization framework of the 
measurement system seems to be a useful input. 
Charge-coupled devices (CCDs) constitutes the state-of-the-art 
of detectors in many observational fields. [2] enumerates the 
areas involved in the recent advances of the CCDs systems, 
which are: 

• Manufacturing standards that provide higher tolerances in 
the CCD process leading directly to a reduction in their 
noise output. 

• Increased quantum efficiency, especially in the far red 
spectral regions. 

• New generation controll electronics with the ability for 
faster readout, low noise performance, and more complex 
control functions. 

• New types of scientific grade CCDs with some special 
properties. 

Any data, in general, is always limited in accuracy and 
incomplete, therefore, deductive reasoning does not seem to 
be the proper way to prove a theory. However, and as said 
in 1 8 1, statistical inference provides a mean for estimating the 
model parameters and their uncertainties, which is known as 
data analysis. It also allows assessing the plausability of one 
or more competing models. 

The use of a Bayesian approach here is also justified in 
fSl where it is stated that for data with a high signal-to-noise 
ratio for example, a Bayesian analysis can frequently yield 
many orders of magnitude improvement in model parameter 
estimation, through the incorporation of relevant prior. This is 
exactly what we intend through the implementation of what 
will be described in this paper, and detailed in the following 
section. 

II. Description of the problem to be resolved 

The problem to be resolved here consists in the imple- 
mentation of bayesian inference for a set of configurable 



parameters which affect the signal-to-noise ratio of a mea- 
surement. In comparisson with other methodologies, such as 
ANOVA, which shows serious weakness when outliers are 
present in the measured data, bayesian parameter inference 
offers a robust method against outliers. It also allow to improve 
the results of inference by using the posterior probability 
density distribution (pdf) of one execution as the prior pdf 
for another execution in a recursive framework. This will lead 
to an adaptive measurement strategy which can be addressed 
as a calibration refinement. 

Professional surveys plan the measurement strategy well in 
advance, taking into account all the relevant factors impacting 
on the measurement; this involves the set of specified fix 
parameters from the detector and also a set of parameters 
which configure the measurement, such as integration time, 
diameter of the aperture, etc. 

Once a measurement has finished, the data are archived and 
their analysis and processing begin. The problem proposed 
here is to establish a link between the results of a measurement 
under a specific detector configuration and the refinement, by 
application of parameter bayesian inference, of the configura- 
tion parameters to be applied in a further measurement. The 
result of this bayesian inference at parameter level is proposed 
to be injected as additional knowledge for a model selection 
problem in the context of measurement data analysi (i.e, 
photometric cross-matching of multiwavelength astronomical 
sources). 

For example, let us imagine that we have performed colour 
measurement in a multi-wavelength survey with ten different 
instruments, each one under a specific configuration. Let us 
imagine that in the process of model comparison for the 
cross-matching scenarios, the existence of a source inferred 
in a bandwith which is not detecting it, is plausible. Then, 
based on this result, a new configuration for that instrument 
can be inferred in such a way that allows us to explore the 
refined plausability indicated by the model comparison from 
the data obtained in the first measurement loop. Figure [3] 
shows a block diagram reflect in general lines the idea of the 
problem proposed here 



III. State of the art of telescope detectors 

The great majority of detectors used in the astronomical 
field are Silicon-based ones; this means that the electronics 
involved in the specification, manufacturing and operational 
life of these detectors are relvant to the outcome obtained. 
As detailed in |3|, the excitations of electrons responding to 
incident photons constitutes the fundamentals for practical flux 
measurement in almost all nowadays photometric systems. 
In nowadays, CCDs are used in many instruments involved 
in the main current astronomical surveys; an extensive and 
increasing bibliography is currently being publishing. There- 
fore our paper will focussed on this type of instruments, 
however we have tried to retain the generic aspect of the 
characterization of any other type of detector and obviously 



the methology presented here is fully valid with any other set 
of specific parameters. 

A. Characterization of telescope detectors 

A summary from |3 |on the relevant information related to 
detector parameters has been included in this subsection for 
the sake of clarity. A more detailed information can be found 
in 13 1 

In general terms and for the context of our problem, a 
detector can be characterized by the following parameters, as 
indicated in 0: 

• Quantum Efficiency (Q): it is the ratio between regis- 
tered events and the incident photon. 

• Information Transfer Efficiency (E): it is the ration 
between the square of the signal-to-noise ratio of the 
output and the square of the signal-to-noise ratio of the 
input. 



E 



{S/N)l 



(1) 



• Noise equivalent power (NEP):it is defined as the optical 
power producing an output equal to the noise level of the 
detector. 

NEP = — (2) 

• Linearity and Saturation: one of the most relevant 
characteristics of a CCD detector is its linear property. 
This means that, under ideal scenario (ignoring effects 
such as, noise, dark current, polarization current, sky 
background contribution etc), the intentsity registered in 
each pixel (as elecrons) is proportional to the incident 
light. However this linear behaviour has its limits. The 
most obvious one is the Saturation threshold, which is 
measured by the Full-well capacity 

• Full- well capacity: this parameter measures the limit of 
the accumulated charge before saturation begins. This 
value is normally included in the technical specification 
of the detector's supplier. 

• Event or pixel capacity: it is defined as how many events 
can be usually accumulated before some saturation effect 
takes place. 

• Working range: also named Dynamic range, is essen- 
tially the same as event capacity for a noiseless accumu- 
lative single detector, but more generally interpreted as 
the difference between useful maximum and minimum 
event counts. 

• Gain(G): the digital image consists of a table of numbers 
which indicate the intensity registered in each pixel. 
However, the numbers stored do not mean the quantity 
of electrons found in each electrode, as this quantity can 
be huge and this would make the resultant storage files 
too big. Therefore what is normally done is to divide 
the quantity of electrons by a certain number, named 
Gain; thus, what we register in the file is the number 



of counts obtained when performed the above division. 
Sometimes counts are called ADU (Analog-to-Digital 
Units) or DN (Data Number). The Gain is therefore 
measured in electrons per count. 

Some cameras allow the user to choose the Gain. Then 
we could choose a small value for faint detections or bit 
to measure correctly sources of various brightness, but 
all this without overpassing the limit done by: 

pixelcapacity 

MaximumGam = (3) 

dynamicrange 

• Counts(c): The number of photons that fall on a pixel 
is related to the counts by: 

Photons —— (4) 

Detector parameters such as quantum efficiency, linearity 
event capacity and so on, are often characterized by figures 
of merit, which manufacturers quote about their products. 

B. Review of the main sources of noise in the photometry 
measurement 

The error which all scientific measurements should carry 
means really uncertainty and it is due to noise. Following the 
line of discussion presented at the begining of this section, 
once an electron has been excited by a photon, the next step 
consists in registering this event by the electronics of the 
detector. In this process, handling and reducing the number 
of extraneous electron activity which does not come from the 
source subject to the pure measurement (noise) consitutes a 
delicate and complex step of this process. A summary from 
(3 1 on the relevant information related to sources of noise has 
been included in this section for the sake of clarity. For more 
detailed information about the functional block description of 
any detector can be found in |3 1 

There are numerous sources of noises in the CCD images. The 
following list identify those sources of noise which are more 
relevant to the problem described in section HH 

• Dark current; also named Thermal noise, it is produced 
by the spontaneous generation of electrons in the Silicium 
(Si) due to the themal excitation of the material. 

The noise associated with dark current in a CCD is 
regarded as having a spatial dependence, in that it relates 
to minor irregularities in the solid state molecular bonding 
lattice, associated with material nterfaces and impurities. 
Each pixel generates a slightly different level of dark 
current, so the noise depends on non-uniformity of the re- 
sponse over the surface as well as the dark current's inher- 
ent quasi-Poissonian contribution to the electron counts. 
Dark current, following a generally Richardson law type 
dependence on temperature, could fill the potential wells 
of an uncooled CCD at 20 °C in, typically, a few seconds, 
but cooling to — 50 °C say, reduces this to a tolerable few 
electrons per pixel per second. The standards approach to 
spatial non-uniformity of response in CCD is through fiat 
fielding, although it should be rememberd that there is 



an additional shot noise contribution to the adopted flat 
field contribution. 

Within the CCD, each pixel has a slightly different gain 
or quantum efficiency when compared with its neighbors. 
In order to flatten the relative response for each pixel to 
the incoming radiation, a flat field image is obtained and 
used to perform this calibration. Ideally, a flat field image 
would consist of uniform illumination of every pixel by 
a light of source of identical spectral response to that of 
the object frames. Once a flat field image is obtained, one 
them simply divides each object frame by it implementing 
then instant removal of pixel-to-pixel variations. 

• Cosmic rays: they are part of the inhabitants of the inter- 
stellar space; cosmic rays are sub-atomic particles which 
go at very high speed, near to the speed of light. Cosmic 
rays are very annoying for the astronomers because they 
interact with the Silicium of the coupled charged devices. 
These electronic micro-aluds, concentrated in a few pixels 
appear in the images as points, or sometimes, lines very 
bright. As much the exposition time is, more quantity of 
cosmic rays will deteriorate it. 

• Read noise; this is a very important contribution to 
the toal noise of the images. This noise is due to the 
random and unavoidable errors that are produced during 
the reading of the image, in the process of amplification 
and counting of the electrons captured in each pixel. 
These errors are intrinsic to the nature of the detector 
device. 

The existance of read noise has always to be taken 
into account because it affects to all steps towards the 
obtainment and traitment of the digital images. Each 
camera must have its own level of read noise, documented 
in its technical specifications. 

• Shot noise: this is a statistical noise due to the inherently 
non-steady photon influx. A Poissonian distribution is 
normal for the arrival of the primary photon stream from 
a source of constant emissive power. 

• Clocking noise; this noise comes from the various high 
frequency oscillators involvd in the gating circuitry. This 
noise rises with load and clocking frequency, but it can 
normally be controlled by manufacturers to a negligible 
level for astronomical applications. 

• Atmosphere: The total noise of detection is not just 
that of the photoelectric effect on the detector, since the 
signal has already been deteriorated by the atmosphere. 
Further reading on the problems with atmosphere in the 
astronomical measurements can be found in |i411 



For simplification purposes in the resolution of the problem 
describe in [III only the following list of noise sources will be 
considered representative for the context of our problem. How- 
ever the methodology can be extended to a more exhaustive 
list of noises: dark current, read noise, background noise. 



IV. SiGNAL-TO-NOISE RATIO AND CONFIGURABLE 
PARAMETERS 

As detailed in f2l, a careful understanding of the main 
sources of uncertainties can suggest ways to improve our 
measurement strategy, this means, the observation, reduction 
and analysis processes. 

A crucial concept in photometry is the signal-to-noise ratio 
(S/N), which is equivalent to the concept of percentage error 

(N/S) X (100%) 

It is very important to assess and, if possible, to reduce the 
noise of the images.In this direction, the parameter S/N is 
very useful in the assessment of the feasibility, reliability and 
quality of the detection. As a general rule, and based on the 
considerations expressed in |5|, to get reliable photometry 
and/or astrometry measurements, the minimum threshold must 
be: S/N = 4 

As a first preliminary simplification, under the asumption of 
photon noise dominating the noise, the counting statistics of 
the number of photons impacting on a given area per second 
can be modelled by a Gaussian distribution, where the scatter 
is the square root of the number of photons, therefore: 



S/N . 



(5) 



Where n includes the photon counts for the sky foreground and 
the sky background, both of them carrying noise components. 
To obtain the count from the source alone, the sky background 
contribution has to be substracted. Then, considering this two 
contributions, we can write the following: 

S/N = ^^"""^^ (6) 

^y ^source H~ 2 X Cbackground 

The S/N ratio changes with the integration time, t and with 
the telescope aperture,I)te^ therefore these two parameters 
encompass the configuration domain by which the S/N ratio 
can be optimized in the process explained in section [III 
For the telescope aperture, we know that increasing the diam- 
eter of the telescope primary, I^^eZ, by a factor of 2 increases 
the collecting area by factor of 4, thus for a given integration 
time, t we get: {S/N)aDtei 

Regarding the integration time dependency with the signal-to- 
noise ratio, we can write: 

tRstar 



S/N 



s 



(7) 



ItRsky 

Where R is th count rate expressed in countss~^. Therefore, 
S/NaVt 

Similarly the number of exposures impact on the noise of 
the resulting image. According to | 5 1 if the exposure is broken 
into n equal short exposures, the error in the mean of the 
measurements would be: 

^individ 



71 



(8) 



Where (Jindivid is the scatter in the individual short exposures 
and rimeas is the number of such short exposures.lt is im- 
portant to keep in mind that if we add n images resulting 



from n short exposures, the resulting signal is S = si -\- 
S2 + ... + Sn, and total noise of the resulting image will 
be = Y^rf + rf + TTT+r^ As a conclusion, the signal- 
to-noise ratio of an addition of images from the same object 
is bigger and therefore better than the signal-to-noise ration of 
each individual image. 

So far, photon noise has been assumed to be the dominant 
contribution of the noise, and therefore the other noise sources 
contributions have been reduced to zero for the S/N equations 
above. However, in the case of faint sources detection the 
dark current and the read noise can play a key role in the 
S/N, therefore, we will develop the S/N equation, with 
the integration time dependency and including at least the 
following sources of noise: photon noise, dark current and read 
noise. For each of the noise sources, valid approximations will 
be considered in order to obtain a final S/N equation which 
is computational cost affordable for a intel mac Core 2 Duo 
computer. 

Therefore, the equation for the S/N of 3. measurement made 
with a CCD can be given by: 

N, 



S/N: 



(9) 



{Ns + TVd + N^r) 
Where: 

• N^ is the total number of photons which compound the 
signal detected from the object 

• Ns is the total number of photons coming from the 
background also called sky 

• Nd is the total number of dark current electrons per pixel 

• N^ is the total number of electrons per pixel resulting 
from the read noise. 

The noise terms in equation [9] can be modelled by Poisson 
distributions. The term Upix is used to apply each noise term 
on a per pixel basis to all the pixels involved in the S/N 
measurement. A more complete equation taking into account 
digitization noise within the A/D converter can be found in 12J 

As explained in [2J, and using the fact that S/N = 1/a, 3. 
standard error for the measurement can be obtained as: 



magnitude 



1.0857v/7V, +p 

TV* 



(10) 



where p is equal to the noise terms indicated in [51 and 
1.0857 is the correction term between an error in flux (elec- 
trons) and that same error in magnitudes (Howell, 1993). 

The equation [9] can also be expressed in terms of count rate 
and integration time, as follows: 

S/N= , (11) 

And in line with the text above, each terms of equation |9] 
can be expressed as follows: 

Gc 



N = Ci- 



Ns = C2 ■ ^J2Csky 



(12) 



(13) 



ND = Ci- 5.86 ■ 10^ • ri • exp-^ 

ZK 1 



(14) 
(15) 



1108 + T 

Where Ci,C2 and Cs are proporcionality constants. 
V. Bayesian inference: estimation of the 

INTEGRATION TIME 

In general, as described in |8|, a Bayesian Probability 
Density Function is a measure of our state of knowlege of 
the value of the parameter. When we acquire some new 
data, B ayes' theorem provides a means for combining the 
information about the parameter coming from the data, 
through the HkeHhood function, with the prior probabiHty, to 
arrive at a posterior probabiHty density, p{H\D^I), for the 
parameter. 

Let us be M the known model for the signal-to-noise 
ratio of a multi-wavelength measurement system which iden- 
tifies cross-matching sources, ^ the set of configurable 
parameters,!^, Dtei}, being t the integration time and Dfei the 
configurable aperture diameter of the instrument, and D the 
data of the measurement. / is the information associated to the 
model. By application of Bayesian inference, the configurable 
parameters for a established model M can be estimated as 
follows: 



pi^\D,M,I) = 



p(<&|M, J).j3(Z)|M,$,J) 
p{D\M,I) 



This equation can be written in the following way: 



p{^\D,MJ) = 



p{^\M, I).p{D\M,^,I) 
J^^p{^\MJ).p{D\M,^J)d^ 



(16) 



(17) 



Therefore it becomes evident that the denominator of [2] 
is just a normalization factor and we can focus on just the 
numerator, where p{^\M^I) is named the probability a priori 
and p{D\M, /) is the likelihood. 

Strictly speaking, as it is explained in [8], Bayesian infer- 
ence does not provide estimates for parameters; rather, the 
Bayesian solution to the parameter estimation problem is the 
full posterior PDF, p{^\D^ M, /) and not just a single point in 
the parameter space. It is useful to summarize this distribution 
and one possible candidate of the best-fit value is the posterior 
mean, 

<^>= J ^p{^\D,M)d^ (18) 

We will develop here the bayesian estimation parameter for 
the integration time t, however the consideration of more than 
one parameter is immediate and in that case multiple integrals 
will be considered for the marginalization of the corresponding 
nuisance parameters. 



A preliminary choice for the prior probability will be a 
Uniform distribution, therefore: 



(19) 



where tmax and tmin are the maximum and minimum integra- 
tion time to be defined for the observation under consideration. 

In general, the difference between the data and the model 
is called the error, therefore: 



p{D\M,t,I)=p{D^,...,DN\M,t,I) 
= p{Eu...,EN\M,t,I) 



(20) 



N 



l[p{E,\M,t,I) 



i=l 



The model here M is the one proposed infTTlhowever for the 
sake of clarity in the following expressions, we will consider 
that equation [TT] can be simplified as follows: 



S/N = C'Vi 



(21) 



Being C a constant 

The likelihood for all the channels N under consideration in 
a multi wavelength observation, can be expressed as follows: 



N 



1 



p{D\t,MJ) = l[—=exp 



-{d,-CVtf 



(22) 



z=l ^ 

Now we can compute p{t\D^ M, /) as indicated in [T71 and 
considering a normalizing unity constant for the denominator 
we obtain the following result:. 

p{t\D, M, I) = p{t\M, I)p{D\M, t, I) (23) 

VI. Conclusion 

The main purpose of the methodology presented here is to 
enable the capability of an observational system to adapt its 
configurable parameters depending on the results from previ- 
ous observations. In this way. It is clear that the integration 
time has a considerably strong impact on the faint sources 
detections, therefore the approach covered in the previous 
chapter of this paper intends to infer the optimum integration 
time in order to increase the probability of detection for the 
next observation to be planned and defined. The inclusion in 
the model of the main contributions of noise leads to a more 
refined parametric inference. 

A toy example has been built in order to show some 
preliminary results derived from the approach presented in this 
paper. Figure ?? represents the data in terms of integration 
times from several observations and Figure ?? shows the 
probability density corresponding to the posterior probability 
solved as detailed in the previous section. 

A real example is planned to be implemented using a model 
of signal-to-noise ratio in line with the expression presented 
in equation [23] 
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Astronomical Model Selection 




Fig. 1. Two levels of Bayesian Inference 




Fig. 2. Toy example: integration time from different observations 




Fig. 3. Probability density corresponding to the integration time inference 



